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0 Artificial chromosome vector. 



© The present invention relates to a recombinant DNA molecule which contains the telomere of a higher 
eukaryote, particularly a plant, the telomere itself, a method of producing a polypeptide in a recipient cell which 
utilizes said recombinant DNA molecule, host ceils transformed with said recombinant molecule, and uses for 
said recombinant molecule. 
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ARTIFICIAL CHROMOSOME VECTOR 



Field of the Invention 



This invention relates to the field of plant recombinant genetics. 

5 

Background of the Invention 



70 The development of recombinant DNA techniques has opened up the possibility of making specific 
changes in an organism's genetic makeup. Alteration of genetic endowment can lead to the development of 
useful strains of microorganisms, and more productive varieties of domesticated plants and animals. For 
example, insect resistant plants have recently been produced by the addition of a bacterial gene which 
instructs the recipient plants cells to produce a protein toxic to certain types of insects (see Vaeck, et al., 

15 Nature 328:33 (1987)). Manipulation of plant species through genetic engineering will become an important 
complement to classical breeding techniques in the development of plant varieties with new traits, such as 
improved nutritional quality, productivity, disease resistance, and drought and salinity tolerance. 

Genetic engineering involves two basic processes: (1) isolation and propagation of new or altered genes 
(molecular cloning), and (2) the instruction of these genes into the recipient organism in a form that allows 

20 the introduced genetic information to be read (i.e., expressed) and transmitted to successive generations. 
The basic techniques of molecular cloning are well established, but the necessary tools for accomplishing 
the efficient introduction and expression of new genetic information in higher eukaryotic organisms are still 
limited. 

Two general approaches are used to introduce a new genetic information into cells, a procedure 

25 commonly referred to as "genetic transformation." One approach is to introduce the new genetic informa- 
tion as part of another DNA molecule, referred to as a "vector," which can be maintained as an independent 
unit (i.e., an episome) apart from the chromosomal DNA molecule(s). Episomal vectors contain all the 
necessary DNA sequence elements required for DNA replication and maintenance of the vector within the 
cell. Many episomal vectors are available for use in bacterial ceils (for example, see Maniatis, T M et al., 

30 Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, 1982)). However, only a few episomal vectors 
that function in higher eukaryotic cells have been developed. The available higher eukaryotic episomal 
vectors are based on naturally occurring viruses and most function only in mammalian cells. In higher plant 
systems there are no known doubie-stranded DNA viruses that replicate through a double-stranded 
intermediate upon which an episomal vector can be based. Although an episomal plant vector based on the 

35 Cauliflower Mosaic Virus has been developed, its capacity to carry new genetic information is limited 
(Brisson et al., Nature 310:511 (1984)). 

The other general method of genetic transformation involves integration of the introduced DNA 
sequences into the recipient cell's chromosomes which permits the new information to be replicated and 
partitioned to the cell's progeny as a part of the natural chromosomes. The most common form of 

40 integrative transformation is called "transfection" and is frequently used in mammalian cell culture systems. 
Transfection involves introduction of relatively large quantities of deproteinized DNA into cells. The 
introduced DNA usually is broken and joined together in various combinations before it is integrated at 
random sites into the cell's chromosome (see, for example Wigler. et al., Cell 11:223 (1977)). A common 
problem with this procedure is the rearrangement of introduced DNA sequences (see Shingo, K., et al., Mol. 

45 Cell. Bioi. 6:1787 (1986)). A more refined form of integrative transformation can be achieved by exploiting 
naturally occurring viruses that integrate into the host's chromosomes as part of their life cycle (e.g., 
retroviruses) (see Cepko, C, et al., Cell 37:1053 (1984)). 

The most common genetic transformation method used in higher plants is based on the transfer of 
bacterial DNA into plant chromosomes that occurs during infection by the phytopathogenic soil bacterium 

so Agrobacterium (see Nester et al., Ann. Rev. Plant Phys. 35:387-413 (1984)). By substituting genes of 
interest for the naturally transferred bacterial sequences (called T-DNA), Investigators have been able to 
introduce new DNA into plant ceils. However, even this more "refined" integrative transformation system is 
limited in three major ways. First, DNA sequences introduced into plant cells using the Agrobacteriumrr- 
DNA systems are frequently rearranged (see Jones, et al., MoL Gen. Genet. 207:478 (1987)). Second, the 
expression of the introduced DNA sequences varies between individual transformants (see Jones et at., 
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EMBO J. 4:2411-2418 (1985)). This variability is presumably caused by rearranged sequences and the 
influence of surrounding sequences in the plant chromosome (i.e., position effects). A third drawback of the 
Agrobacterium/T-DNA system is the reliance on a "gene addition" mechanism: the new genetic information 
is added to the genome (i.e., all the genetic information a cell possesses) but does not replace information 

5 already present in the genome. While gene addition is suitable for many applications, the ability to actually 
replace a specific gene with an altered copy via homologous recombination (i.e., recombination between 
DNA of the same or a simitar sequence) would be extremely useful. Gene replacement in mammalian cells 
using a transfection protocol has been attempted, but the procedure is inefficient (approximately 1 out of 
every 1000 transformed cells underwent a gene replacement event in Thomas et aL Cell 44:419 (1986); see 

10 also Smithies, et al. ( Nature 317:230 (1985)). 

The present invention discloses linear episomal transformation vectors, based on natural chromosomes, 
that can replicate and be stably maintained in higher plant cells. These artificial plant chromosome vectors 
will provide a versatile tool for genetic transformation of plant species and solve many of the problems 
associated with present DNA transformation technology. In addition, development of artificial plant chro- 

75 mosome vectors will facilitate the construction of artificial chromosomes that can function in other higher 
eukaryotic cells. 

Artificial chromosomes are man-made linear DNA molecules constructed from essential cis-acting DNA 
sequence elements that are responsible for the proper replication and partitioning of natural chromosomes 
(see Murray et ai., Nature 301:189-193 (1983)). These essential elements are: (1) Autonomous Replication 

20 Sequences (ARS) (have properties of replication origins, which are the sites for initiation of DNA replication), 
(2) Centromeres (site of kinetochore assembly and responsible for proper distribution of replicated 
chromosomes at mitosis and meiosis), and (3) Telomeres (specialized structures at the ends of linear 
chromosomes that function to stabilize the ends and facilitates the complete replication of the extreme 
termini of the DNA molecule). 

25 At present, these essential chromosomal elements have been isolated only from lower eukaryotic 
species. ARSs have been isolated from the unicellular fungi Saccharomyces cerevisiae (brewer's yeast) and 
Schizosaccharomyces pombe (see Stinchcomb et ai., Nature 282:39-43 (1979) and Hsiao et al. f J. Proc. 
Natl. Acad. Sci. USA 76:3829-3833 (1979)). ARSs behave like replication origins allowing DNA molecules 
that contain the ARS to be replicated as an episome after introduction into the cell nuclei of these fungi. 

30 Although plasmids containing these sequences replicate, they do not segregate properly. 

Functional centromeric (CEN) sequences have been purified from S. cerevisiae (see Clark et al. t Nature 
287:504-509 (1980) and Stinchcomb et al.. J. Molec. Biol. 158:157-179 (1982)). Episomes carrying the yeast 
CEN sequences display proper segregation into daughter yeast cells during mitosis and meiosis, in contrast 
to ARS plasmids lacking a centromere. 

35 Telomeres, the last chromosomal element in lower eukaryotes to be cloned, are thought to be involved 
in the priming of DNA replication at the chromosome end (see, for example, Blackburn et al., Ann. Rev. 
Biochem, 53:163-194 (1984)). This is because conventional DNA polymerases are template dependent, 
synthesize DNA in the 5 to 3' direction, and require an oligonucleotide primer to donate a 3 OH group. 
When this primer is removed, unreplicated single-stranded gaps arise; most of these gaps can be filled in 

40 by priming from 3' OH groups donated by newly replicated strands located at the 5 end of the gap. 
However, the unreplicated gaps which lie next to the extreme S end of the DNA duplex cannot be primed 
in this manner. Consequently, telomeres must provide an alternative priming mechanism. 

Telomeres are also responsible for the stability of chromosomal termini. Telomeres act as "caps," 
suppressing the recombinogenic properties of free, unmodified DNA ends (see Blackburn, supra). This 

45 reduces the formation of damaged and rearranged chromosomes which arise as a consequence of 
recombination-mediated chromosome fusion events. 

Telomeres may also contribute to the establishment or maintenance of intranuclear chromatin organiza- 
tion through their association with the nuclear envelope (see, for example, Fussell, CP., Qenetica 62:192- 
201 (1984)). 

so Telomeric or telomeric-like DNA sequences have been cloned from several lower eukaryotic organisms, 
principally protozoans and yeast. The ends of the Tetrahymena linear DNA plasmid have been shown to 
function like a telomere on linear plasmids in S. cerevisiae (see Szostak, J.W., Cold Spring Harbor Symp. 
Quant. Biol. 47:1187-1194 (1983)). A telomere from the flagellate Trypanosoma has been cloned (see, for 
example, Blackburn et al.. Cell 36:447-457 (1984). A yeast telomeric sequence has been identified (see, for 

55 example, Shampay et al., Nature 310:154-157 (1984)). 

None of the essential components, including the telomeres, however, function in higher eukaryotic 
systems. For example, there have been numerous attempts to isolate ARSs from other eukaryotes by 
selecting for pieces of DNA that will serve as an ARS in yeast. While such DNA fragments can be readily 
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identified, they do not promote extrachromosomal replication in cells from the donor organism. 

DNA molecules carrying ARSs that function in yeast ceils do not promote extrachromosomal replication 
of these molecules in mouse cells (see Roth et al., Mol. Cell, Biol. 3:1898-1908 (1983)). An ARS sequenced 
from cultivated tomato, which operates in yeast, fails to function in tomato cells (see Zabel, P., "Toward the 

5 Construction of Artificial Chromosomes for Tomato," in: Molecular Form and Functions of the Plant 
Genome, (Plenum Press (1985) and Jongsma et al.. Plant Molec. Biol. 8:383-394 (1987)). Similarly, yeast 
CEN sequences do not function when introduced into mouse (Roth et al. Mol. Cell. Biol. 3: 1878 (1983)) or 
Aspergillus chromosomes (Boylan, et al. a Mol. Cell. Biol. 6:3621 (1986)). In addition, telomeres from the 
protozoan Tetrahymena do not function in cells of the vertebrate Xenopus (Yu, et a!., Gene 56:313 (1987)). 

io Finally, although researchers were able to show that a S. pombe chromosome can replicate at a reduced 
efficiency in mouse cells, the centromeres of this lower eukaryote apparently do not function in the higher 
eukaryote nucleus (see Allshire et al., Cell 50:391-403 (1987)). 

Artificial chromosomes have been constructed in yeast using the three cloned essential chromosomal 
elements. Murray et al., Nature 305:189-193 (1983), disclose a cloning system based on the in vitro 

is construction of linear DNA molecules that can be transformed into yeast, where they are maintained as 
artificial chromosomes. These artificial yeast chromosomes contain cloned genes, replicators, centromeres 
and telomeres but have an impaired centromeric function in short (less than 20kb) artificial chromosomes. 

The ability to construct artificial chromosomes that function In yeast, however, does not teach one 
skilled in the art how to construct an artificial chromosome using the essential elements which will function 

20 in higher eukaryotes. The cells of higher eukaryotlc organisms differ from yeast cells in ways that place 
many further demands on the functioning of their genes. The amounts of DNA in higher eukaryotes is large, 
varying between species in ways that are not yet completely understood. They have more DNA and their 
genomes are composed of different classes of sequences: true gene regions interrupted by numerous and 
lengthy introns, structural sequences necessary for chromosome shorting during cell division, and various 

25 kinds of repetitive sequences. 

Lower eukaryotic systems were the only source of the artificial chromosome essential elements until the 
teaching of the present invention. It is clear that the construction of an efficient, functional artificial plant 
chromosome requires the isolation of the three essential elements from plant chromosomes, and this novel 
methodology and the elements derived therefrom are taught by the present invention. 

30 

Summary of the Invention 

35 The present invention provides for the construction of a recombinant molecule which contains the 
telomere of a higher eukaryote. 

The invention also provides for a method of producing a polypeptide which comprises cloning a gene 
sequence capable of producing the polypeptide into a recombinant molecule which contains a telomere of a 
higher eukaryote, introducing the recombinant molecule containing the cloned gene sequence into a 

40 recipient cell, and incubating the cell such that it produces the polypeptide. 

This invention also discloses a novel method of isolating and enriching the telomeric clones of higher 
eukaryotic organisms by using genetic engineering methods in a suitable manner described hereinafter. 

The invention provides an alternate method for the transformation of plant ceils by constructing 
functional artificial chromosome vectors. 

45 Artificial chromosomes that function in higher eukaryotes have never been constructed. The present 
invention discloses for the first time the molecular cloning of telomeres from A. thaliana, a small flowering 
plant in the mustard family. Before now, no essential chromosomal element had been isolated from a higher 
eukaryote. The cloned A. thaliana telomere serves as a starting point for the construction of a functional 
artificial plant chromosome. 

so Use of the artificial chromosome vector of the present invention solves many of the problems 
associated with the current transformation technology used to introduce new DNA into higher eukaryotic 
cells. Since artificial chromosomes are maintained in the cell nucleus as independently replicating DNA 
molecules, sequences introduced on such vectors are not subject to variable expression due to integration 
position effects. 

55 The problem of DNA rearrangement which occurs when sequences are introduced by transfection 
protocols or using the Agrobacterium/T-DNA system can be avoided with the present invention. Integrative 
transformation systems generally involve introduction of DNA in recombinogenic form (e.g., DNA with free 
ends) and selection for transformants that splice together the Incoming sequences and the host chro- 
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mosome (see Orr-Weaver et al M Proc. Natl. Acad. ScL USA 78:6354-6358 (1981)). Since an artificial 
chromosome can be delivered to the plant nucleus as an intact, unbroken, double-stranded DNA molecule 
with telomeric ends, DNA can be maintained stably in that form and rearrangements should not occur. 

Artificial chromosome vectors can also be used to perform efficient gene replacement experiments. At 

5 present, gene replacement has not been demonstrated in a plant system and has only been detected at low 
frequency in mammalian tissue culture systems (see Thomas et al., Cell 44:419-428 (1986) and Smithies et 
al.. Nature 317:230-234 (1985)). The reason for this is the high frequency of illegitimate nonhomologous 
recombination events relative to the frequency of homologous recombination events (the latter are 
responsible for gene replacement). Artificial chromosomes participate in homologous recombination pref- 

10 erentially. Since the artificial chromosomes remain intact upon delivery, no recombinogenic broken ends will 
be generated to serve as substrates for the extremely efficient illegitimate recombination machinery. Thus, 
the artificial chromosome vectors disclosed by the present invention will be stably maintained in the nucleus 
through meiosis and available to participate in homology-dependent meiotic recombination. 

In addition, because in principle, artificial chromosomes of any length could be constructed using the 

75 teaching of the present invention, the vectors could be used to introduce extremely long stretches of foreign 
DNA into cells. 

Another new and useful feature of the present artificial chromosome transformation system is the 
absence of host range limitations; foreign DNA can be delivered to plant cells directly without the use of 
Agrobacterium. 

20 

Description of the Figures 

25 Figure 1 shows schematically the procedure used for constructing the primary telomeric libraries 

which produce random, non-telomeric clones and telomeric clones, and the procedure used for enriching for 
the telomeric clones produced by the two independent primary libraries. 

Figure 2 shows a Southern blot representation of the hybridization of the pAtT4 insert to Bal3l- 
treated A. thaliana nuclear DNA. 

30 Figure 2(A) shows A. thaliana nuclear DNA treated with Bal31 for 0 (lane 1) 5 (lane 2), 15 (lane 3), 30 

(lane 4), or 50 minutes (lane 5), and subsequently digested with Mbol (left lanes) or BamHI (right lanes). 
The DNA was sized-fractionated by electrophoresis through a 0.8% agarose gel and transferred to a nylon 
membrane. The membrane was probed with radiolabeled pAtT4 and washed at high stringency. 

Figure 2(B) shows the membrane shown in (A), rehybridized with radiolabeled pARR16, an A. thaliana 

35 rDNA clone, and washed at high stringency. 

Figure 3(A) depicts the chromosome configuration in a telotrisomic mutant. The extra telocentric 
chromosome has two telomeres: one identical to that present on the full length chromosomes, and a new 
telomere added adjacent to the centromere. A hypothetical restriction map shown under the linear 
chromosomal DNAs illustrates the expected correlation between the telotrisomic genotype and the presence 

40 of a novel telomeric restriction fragment. 

Figure 3(B) shows a Southern blot representation of three disomic and three telotrisomic plants from 
the progeny of a Tr5A telotrisomic parent probed with radiolabeled pAtT4 and washed at high stringency. A 
novel, polymorphic 15kb Dral band, present in TrSA DNA but absent from wildtype disomic (wt) DNA, is 
marked with an arrowhead. 

45 Figure 4 shows the DNA sequence of the pAtT4 insert. Insert sequences are shown in larger font and 

flanking vector sequences displayed in smaller font. The fusion of the blunt cloning site of the vector to the 
telomeric sequences occurs after the underlined sequence denoted 1/2 Hindi. The Mbol site of the insert, 
which is joined to the BamHI site of the vector, is underlined and labeled. The prevalent telomeric repeat 5 - 
[CCCTAAA]-3' is displayed in heptameric blocks. The two variant repeats, 5 -[CTCTAAA]-3 , are underlined. 

so Figure 5 shows a Southern blot representation of the pAtT4 insert hybridization to Bal31 -treated Z. 

mays DNA. 

Figure 5(A) shows Z. mays genomic DNA treated with Bal31 [0.5 U/ml] for 0 (lane 1), 15 (lane 2), 35 
(lane 3), 60 (lane 4), or 90 minutes (lane 5), and subsequently digested with Bc1i. The DNA was size- 
fractionated by electrophoresis through a 0.7% agarose gel and transferred to a nylon membrane. The 
55 membrane was then probed with radiolabeled pAtT4 and washed at high stringency. 
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Figure 5(B) shows 2. mays genomic DNA treated with Ba131 [0.25 U/ml] for 0 (lane 1). 5 (lane 2), 15 
(lane 3), 30 (lane 4), and 50 minutes (lane 5), and subsequently digested with Taql. The DNA was size- 
fractionated by electrophoresis through a 0.8% agarose gel and transferred to a nylon membrane. The 
membrane was then probed with radiolabeled pAtT4 and washed at high stringency. 
5 Figure 6 shows a Southern blot representation of the pAtT4 hybridization to Bal31 -treated human 

DNA. 

Figure 6(A) shows human genomic DNA [1mg/mi] treated with Bal31 [1 U/ml] for 0 (lane 1), 5 (lane 
2), 15 (lane 3), 30 (lane 4), and 60 minutes (lane 5), and subsequently digested with Hindlll. The DNA was 
sized-fractionated by electrophoresis through a 0.7% agarose gel and transferred to a nylon membrane. 
w The membrane was then probed with radiolabeled pAtT4 and washed at low stringency. 

Figure 6(B) shows the membrane in 5(A) rehybridized with a radiolabeled Xenopus rDNA clone 
(X1 .4DNA), and washed at low stringency. 

Figure 7 schematically shows a procedure for making minichromosomes. 

Figure 8 schematically shows a shotgun cloning protocol for the functional selection of a plant ARS 
75 sequence. 

Figure 9 schematically shows a cloning protocol for cloning plant centromeres using telomeric probes 
and telotrisomic mutants. 

Figure 10 shows schemes for constructing plant artificial chromosomes in the format E. coli/plant 
circular shuttle vectors (A); yeast/plant circular shuttle vectors (B); and yeast/plant linear shuttle vectors (C). 
20 " ; 

Description of the Preferred Embodiments 



25 

I. DEFINITIONS AND BACKGROUND 

In the following description, reference will be made to various methodologies well known to those of skill 
in the art of recombinant genetics. Publications and other materials setting forth such well known 
30 methodologies will be referred to in the course of this description, and are incorporated herein by reference. 
The present invention relates to the construction of artificial chromosome vectors for the genetic 
transformation of plant cells, processes for their preparation, uses of the vectors, and systems transformed 
by them. Standard reference works setting forth the general principles of recombinant DNA technology 
include Lewin, B.M., Genes II, John Wiley & Sons, Publishers, New York (1985). Other works describe 
35 methods and products of genetic engineering; see, e.g., Maniatis et at., Molecular Cloning, A Laboratory 
Manual (Cold Spring Harbor Laboratory, 1982); Watson et al., Recombinant DNA: A Short Course (W.H. 
Freeman and Co., 1983); Setlow et aL, Genetic Engineering: Principles and Methods (Plenum Press, 1979); 
and Dillon et al., Recombinant DNA Methodology (John Wiley & Sons, 1985). 

By "transformation" or "transfection" is meant the acquisition in cells of new genetic markers through 
40 incorporation of added DNA. This is is the process by which naked DNA is introduced into a cell, resulting 
in a heritable change. 

As used herein, "eukaryote" refers to living organisms whose cells contain nuclei. A eukaryote may be 
distinguished from a "prokaryote" which is an organism which lacks nuclei. Prokaryotes and eukaryotes 
differ fundamentally in the way their genetic information is organized, as well as their patterns of RNA and 
45 protein synthesis. 

By the term "tower eukaryote" is meant a eukaryote characterized by a comparatively simple 
physiology and composition, and unicellularity. Examples of lower eukaryotes include flagellates, ciliates, 
and yeast, for example. 

By contrast, the term "higher eukaryote" means a multicellular eukaryote, characterized by its greater 
so complex physiological mechanisms as well as its ability to interact with its environment in a more 
sophisticated manner. Generally, more complex organisms such as plants and animals are included in this 
category. Preferred higher eukaryotes to be transformed by the present invention include, for example, 
monocot and diocot angiosperm species, gymnosperm species, fern species, plant tissue culture cells of 
these species, and algal cells. It will of course be understood that prokaryotes and eukaryotes alike may be 
55 transformed by the methods of this invention. 

As used herein, the term "plant" includes plant cells, plant protoplasts, plant calli, and plant cells or 
tissue culture from which whole plants can be regenerated. 

A variety of processes are known which may be utilized to effect transformation; i.e., the inserting of a 
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heterologous gene into a host cell, whereby the host becomes capable of efficient expression of the 
inserted genes. By "gene" is meant a DNA sequence that contains information for construction of a 
polypeptide or protein, and includes 5 and 3 ends. 

As used herein, "heterologous gene" is a structural gene that is foreign, Le., originating from a donor 

s different from the host or a chemically synthesized gene, and can include a donor of a different species 
from the host The heterologous gene codes for a polypeptide ordinarily not produced by the organism 
susceptible to transformation by the expression vehicle. Another type of "heterologous gene" is an altered 
gene from the host itself. One example of such an altered gene useful in the present invention is a mutant 
gene which encodes a herbicide-resistant form of a normally occurring enzyme. 

w By "host" is meant any organism that is the recipient of a replicable plasmid, or expression vector. 
Ideally, host strains used for cloning experiments should be free of any restriction enzyme activity that 
might degrade foreign DNA. For example, it is advantageous that a preferred E. coli host strain be defective 
in homology-based recombination (recA) since inadvertent recombinational events may threaten the 
integrity of the cloned DNA. The nucleic acid of the host should also be easily separable from that of the 

T5 cloning vector by physical means (e.g., differential centrifugation based on G-C content or encapsulation) or 
differ in its reactivity to alkali or organic acids (phenol). Preferred examples of host cells for cloning, useful 
in the present invention, are bacteria such as Escherichia coli, Bacillus subtilis, Pseudomonas, Strep- 
tomyces, Salmonella, and yeast such as S. cerevisiae. 

By "expression" is meant the process by which a structural gene produces a polypeptide. It involves 

20 transcription of the gene into messenger RNA (mRNA) and the translation of such mRNA into poiypeptide- 
(s). 

A method useful for the molecular cloning of DNA sequences includes in vitro joining of DNA segments, 
fragmented from a source high molecular weight oligonucleotide, to vector DNA molecules capable of 
independent replication. The cloning vector may include plasmid DNA (see Cohen, et al., Proc, Natl. Acad. 

25 Sci. USA 70:3240 (1973)), phage cDNA (see Thomas, et al., Proc. Natl. Acad. Sci. USA 71:4579 (1974)), or 
SV40 DNA (see Nussbaum, et al., Proc. Natl. Acad. Sci. USA 73:1068 (1976)). 

The term "plasmid" or "cloning vector" as used herein refers to a closed covalently circular extrach- 
romosomal DNA which is able to autonomously replicate in a host cell and which is normally nonessential 
to the survival of the cell. A wide variety of plasmids are known and commonly used in the art (see, for 

30 example, Cohen, S. et al., U.S. patent 4.468,464, which discloses examples of DNA plasmids). In a 
preferred embodiment, the plasmids used in the present invention are identical except for the orientation of 
the polylinker (see Levinson, et ai. J. Mol. AppL Genet. 2:507-517 (1984)). In a preferred embodiment, the 
plasmids used in the present invention also contain an M13 origin which allows single-stranded versions of 
the plasmids to be generated. 

35 The single-stranded, filamentous bacteriophage M13 is a well-known cloning vector (see. for example, 
Messing, et al., Proc. Natl. Acad. Sci. USA 74:3642-3646 (1977); Van Wegenbeek et ai., Gene 11:129-148 
(1980); and Gillam et al., Gene 12:129-137 (1980)). Preferably, the small plasmids pSDC!2 and pSDC13. 
with cloning polylinker, may be used. 

By "polylinker" it is meant a DNA molecule, generally 50 to 60 nucleotides long and synthesized 

40 chemically. In a preferred embodiment, this fragment contains one, or preferably more than one, restriction 
enzyme site for a blunt-cutting enzyme and a staggered-cutting enzyme, preferably BamHl. One end of the 
polylinker fragment is adapted to be ligatable to one end of the linear molecule and the other end is 
adapted to be ligatable to the other end of the linear molecule. 

A fragment of DNA, from any source whatsoever, may be purified and inserted into the plasmid at any 

45 of the polylinker region restric tion endonuclease cleavage sites. DNA to be inserted into a plasmid may be 
from prokaryotes, such as Escherichia coli, Bacillus subtilis, Streptomyces lividans or the like; from viruses 
which infect prokaryotes, such as lambda, T*, C31 or the like; from eukaryotes, such as Saccharomyces 
cerevisiae, Mus muscuius, Homo sapiens, or the like; or from viruses which infect eukaryotes such as 
Herpes Simplex, SV40. or the like. 

so The techniques and procedures required to accomplish insertion are wefl-known to the art (see Maniatis 
et al., Molecular Cloning; A Laboratory Manual, Cold Spring Harbor Laboratory, 1982). Typically, this is 
accomplished by incubating the plasmid in the presence of a restriction endonuclease such that the 
restriction endonuclease cleaves the plasmid to produce a linear molecule. 

Endonucleases preferentially break the internal phosphodiester bonds of polynucleotide chains. They 

55 may be relatively unspecific, cutting polynucleotide bonds regardless of the surrounding nucleotide 
sequence. However, the endonucleases which cleave only a specific nucleotide sequence are called 
restriction enzymes. Restriction endonucleases generally internally cleave foreign DNA molecules at 
specific recognition sites, making breaks within "recognition" sequences that exhibit two-fold symmetry 
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around a given point Because the same sequence (running in opposite directions) is found on both strands, 
such enzymes always create double-stranded breaks. 

Most often, the enzymes make a staggered cleavage, yielding DNA fragments with protruding single- 
stranded 5' termini. Such ends are said to be "sticky" or "cohesive" because they will hydrogen bond to 
s complementary 3' ends. As a result, the end of any DNA fragment produced by an enzyme, such as EcoRI, 
can annea! with any other fragment produced by that enzyme. This properly allows splicing of foreign 
genes into plasmids, for example. For purposes of the present invention, preferred restriction en- 
donucleases include Hindi, Mbol, and Bam HI. 

Some of these endonucleases create fragments that have blunt ends, that is, that lack any protruding 
10 single strands. When DNA has been cleaved with restriction enzymes that cut across both strands at the 
same position, blunt end ligation can also be used to join the fragments directly together. 

The great advantage of this technique is that any pair of ends may be joined together, irrespective of 
sequence. 

Those nucleases that preferentially break off terminal nucleotides are preferred to exonucleases. For 

75 example, small deletions can be produced in a circular DNA molecule by cleaving it with a restriction 
enzyme that linearizes it. The linear molecules are then treated with an exonuclease which starts from each 
3' end of the DNA and chews away single strands in a 3' to 5' direction, creating a population of DNA 
molecules with single-stranded fragments at each end, some containing terminal nucleotides. The nuclease 
Bal3i has often been used because this enzyme chews away both strands from the ends of linear DNA 

20 molecules. Exonucleases to be used in the present invention preferably include Bal31, SI, or ExollL 

Once the source DNA sequences have been cleaved and the plasmid linearized, they are incubated 
together and enzymes capable of mediating the ligation of the two DNA molecules are added. Preferred 
enzymes include T4 ligase, E. coli ligase, or other similar enzymes. The action of these enzymes results in 
the sealing of the linear DNA to produce a closed covalently circular DNA molecule containing the desired 

25 fragment (see, for example, U.S. Patents 4,237,224; 4,264,731; 4,273,875; 4,322,499 and 4,336,336). These 
nucleolytic reactions can be controlled by varying the time of incubation, the temperature, and the enzyme 
concentration needed to make deletions. 

It is to be understood that the termini of the linearized plasmid and the termini of the DNA fragment 
being inserted must be complementary in order for the ligation reaction to be successful. Suitable 

30 complementarity can be achieved by choosing appropriate restriction endonucleases (i.e., if the fragment is 
produced by the same restriction endonuclease so that used to linearize the plasmid, then the termini of 
both molecules will be complementary). As discussed previously, In a preferred embodiment, at least two 
classes of the plasmids used in the present invention are adapted to receive the foreign oligonucleotide 
fragments in only one orientation. 

35 After joining the DNA segment to the plasmid, the resulting hybrid DNA (known as a chimera) can then 
be selected from among the large population of clones or libraries. 

As used herein, a "library" is a pool of random DNA fragments which are cloned. In principle, any gene 
can be isolated by screening the library with a specific hybridization probe (see, for example, Young et a!., 
In: Eukaryotic Genetic Systems, ICN-UCLA Symposia on Molecular and Cellular Biology, VII (New York, 

40 Academic Press), pp. 315-331 (1977)). Each gene library may contain the DNA of a given organism inserted 
as discrete restriction enzyme-generated fragments into many thousands of plasmid vectors. For purposes 
of the present invention, E. coli, yeast, and Salmonella plasmids are particularly useful when the genome 
inserts come from other organisms. 

The feasibility of constructing and screening libraries from the DNA of organisms with small genomes, 

45 such as Drosphila or yeast has been demonstrated (see, for example, Weinsink et al., Cell 3:315-325 
(1974)) and Carbon et al.. In: Recombinant Molecules: Impact on Science and Society (Raven Press), pp. 
335-378 (1977)). Handling libraries from larger genomes, however, is difficult because of the numerous 
technical limitations inherent in manipulating such a complex system. 

By "hybridization" is meant the pairing of complementary RNA and DNA strands to produce an RNA- 

so DNA hybrid, or the partial pairing of DNA single strands from genetically different sources. 

As used herein, a "probe" is any biochemical (usually tagged in some way for ease of identification), 
used to identify or isolate a gene, a gene product, or a protein. 

A "selectable marker" is a gene with a known location on a chromosome and a clear-cut phenotype, 
permitting the identification of the existence of a particular sequence in which the marker has been 

55 attached. Use of selectable markers is described, for example, in Broach et al., Gene 8:121-133 (1979). 
Examples of selectable markers include the thymidine kinase gene, the cellular adenine-phosphoribosyf- 
transferase gene and the dihydrylfolate reductase gene, and preferably hygromycin phosphotransferase 
genes and neomycin phosphotransferase genes, among others. Preferred selectable markers in the present 
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invention include genes whose expression confer an antibiotic resistance to the host cell, sufficient to 
enable the maintenance of a piasmid within the host cell, and which facilitate the manipulation of the 
plasmid into new host cells. Of particular interest in the present invention are proteins conferring cellular 
resistance to ampicillin, chloramphenicol, tetracyline, and G-418, for example. 

5 

II. TELOMERE 

Based upon nuclease cleavage and ligation methods, utilization of plasmids with polylinkers oriented in 
w opposite orientation, and the use of an M13 helper phage, the telomeres of higher eukaryotes, preferably of 
a plant, may be isolated. 

In a preferred embodiment, unrestricted, high molecular weight nuclear DNA from any source whatso- 
ever may be treated briefly with an exonuclease to create blunt ends. (Source DNA may' be from 
prokaryotes, eukaryotes, or viruses which infect prokaryotes or eukaryotes, as previously detailed.) The 
rs blunt ends of the fragmented DNA are then ligated to a small E. coli plasmid cloning vector, for example, 
which has been previously digested with two different restriction endonucieases to generate one blunt end 
and on§ sticky end. The ligation products are then digested with the restriction endonucieases, leaving 
compatible ends. Next, the restriction fragments are ligated to form circles which are then propagated in an 
E. coli host, for example. 

20 Two independent primary libraries are constructed in this fashion: one using pSDCl2, for example, and 
the other using the complementary vector, pSDC13. 

The primary libraries will contain inserts corresponding to sequences that lie adjacent to the ends of the 
large genomic DNA fragments. Since chromosome size DNA cannot be isolated, only a small fraction of the 
inserts are derived from telomeric ends; the remainder will represent sequences that lie next to random, 

25 shear-generated ends. Thus, the ratio of telomeric to random inserts is a function of chromosome size and 
the average size of the isolated genomic DNA fragments. According to the present invention, the A. thaliana 
telomeric clone can be obtained at a frequency of 10~ 3 in each of the primary libraries. Each library should 
be composed of about 5000 members in order to ensure that each primary library has at least one 
telomeric clone. 

30 To increase the representation of telomeric clones, plasmids containing inserts that are present in both 
primary libraries must be identified and saved, and random clones, isolated from broken ends, discarded, to 
form a new collection of clones referred to as the secondary library. This procedure will save telomeric 
clones in which the primary libraries were large enough to ensure that each contained at least one telomeric 
clone. 

35 Although methods for the enrichment of desired DNA sequences prior to cloning are known (see, for 
example, Tilghman et al., Proc. Nafl Acad Sci. USA 74:4406-4410 (1977)), the novel enrichment protocol of 
the present invention capitalizes on the underrepresentation of nontelomeric sequences in the two primary 
libraries. Random, nontelomeric single copy sequences will not be represented in both primary libraries due 
to the small insert size (average = 0.3kb) and the small member size (about 5,000/primary library) of the 

40 primary libraries. Each primary library can represent, at most, only 15,000kb (5,000 X 0.3kb) of a 70,000kb 
haploid genome. Because the probability of cloning any particular DNA sequence independently from 
random chromosomal ends in both primary libraries is quite low, nontelomeric sequences will be under- 
represented and enrichment for telomeric clones can thus be achieved by saving clones with inserts 
common to both primary libraries (See Table 1). 

45 To assemble a secondary library, single-stranded DNA plasmids are first made by infection of both 
primary libraries with an M13 helper phage, for example. By utilizing plasmids with polylinkers oriented in 
opposite directions, infection with the M13 bacteriophage can separate the double-stranded DNA into two 
sets of complementary single-stranded molecules (see, for example, Levinson, et al., J. Moi. AppL Genet 
2:507-517 (1984)). The single-stranded DNAs are mixed and allowed to anneal. Since the M13 system 

so packages only one strand of the plasmid, the vector sequences can not hybridize to themselves. A plasmid 
can only cross-hybridize to another single-stranded plasmid if the inserts of these plasmids are homologous 
and represent complementary strands. 

The reassociation of two complementary sequences of DNA occurs by base pairing, Renaturation of 
DNA depends on random collision of the complementary strains; it therefore follows second order kinetics. 

55 This means that the rate of reaction is governed by the concentration of DNA that is single-stranded at any 
particular time, and the time of incubation. In a preferred embodiment, the concentration of such single- 
stranded plasmids will be 400 mg/ml. 

The single-stranded plasmids with complementary DNA inserts cross-hybridize to form a double- 



9 



EP 0 338 266 A2 



stranded DNA section and the single-stranded plasmids without such cross-hybridizing inserts remain as 
singie-stranded DNA molecules. Hydroxylapatite chromatography, which distinguishes between single- and 
double-stranded DNAs, can be used to purify cross-hybridizing single-stranded plasmids (see, for example, 
Britten, et al., Methods Enzymol. 29:363 (1974)). After isolation, the cross-hybridizing plasmids may be 

5 transformed into a recombination-deficient E. coli host, for example. 

The secondary library is thereafter screened in order to identify the clones that contain the specific 
hybrid DNA, or telomeric insert, desired. The screening of restriction fragments may be accomplished by a 
variety of methods; for example, gel electrophoresis, Southern blotting, buoyant density, etc. In a preferred 
embodiment, this screening for telomeric sequences may be correlated to exonuclease sensitivity. 

ro Telomeric DNA sequences are preferentially sensitive to exonuclease attack since they always reside next 
to a DNA end. (See, for example, Yao, et al., Proc. Natl. Acad. Sci. USA 78:7436-7439 (1981)). Non- 
telomeric sequences are degraded by exonuclease treatment of high molecular weight genomic DNA only 
when an infrequent random breakage occurs nearby. 

In a preferred embodiment, the screen may be conducted using Southern blots (see Southern, J. Mol. 

rs Biol. 98:503 (1975)) prepared as follows. Source nuclear DNA can be digested with an exonuclease for 
various lengths of time (see Figure 2) and subsequently digested with a restriction endonuclease, size- 
fractionated by agarose gel electrophoresis and transferred to nylon membranes. These Southern blots may 
then be probed with plasmid DNAs (pooled five at a time) prepared from clones in the secondary library. 
Plasmids which hybridize to restriction fragments which shift electrophoretic mobility upon increasing 

20 exonucleolytic digestion are telomeric clones. 

Using this cloning and enrichment method, a telomeric clone which hybridized to exonuclease-sensitive 
restriction fragments was isolated from A. thaliana. This plasmid, designated pAtT4, was discovered to be 
described by the DNA sequence 5-[CCCTAAA]-3' (see Example 5). A variant repeat was also found 
consisting of tandemly repeated blocks of the base sequence 5 -[CTCTAAA]-3 . The pAtT4 plasmid was 

25 deposited under the terms of the Budapest Treaty at the American Type Culture Collection, Rockville, 
Maryland, before the filing date of this application, and assigned Accession Number ATCC 67577. It is 
described more fully in Figure 4. 



30 III. AUTONOMOUS REPLICATING SEQUENCE 

Prior attempts to isolate functional higher eukaryotic replication origins have been largely unsuccessful. 
Some have attempted to isolate these ARSs by selecting for ARS function in yeast (see, for example, Ross 
et al., Mol. Cell. Biol. 3:1898-1908 (1983); Stinchcomb et aL. Proc. Natl. Acad. Sci. USA 77:4559-4563 

35 (1980)). Unfortunately, these "ARS" sequences do not function to support extrachromosomal replication 
when introduced back into the natural host organism (see Jongsma et al„ Plant Mol. Biol. 8:383-394 (1987)). 
The inventors have discovered that the best approach for isolating plant replication origins relies on 
functional tests for extrachromosomal replication in plant cells. 

The present invention discloses a novel method for isolating the ARS sequences which involves the 

40 formation of minichromosome derivatives of natural chromosomes (see Figure 7). It has been demonstrated 
in yeast that inverted repeats of telomeric sequences are "resolved" by an unknown mechanism which 
results in a double-stranded cleavage between the inverted repeats (see, for example, Szostak, Cold Spring 
Harbor Symp. Quant. Biol. 47:1187-1193 (1983)). After an inverted telomere repeat is introduced into a 
chromosome, a resolution reaction will lead to scission of the chromosome and formation of two 

45 chromosomal fragments, each with two telomeres (see Murray et al., Cell 45:529-536 (1986)). This process 
generates a minichromosome small enough to be isolated intact (<1Mb) allowing further manipulation by in 
vitro techniques to delimit the sequences responsible for autonomous replication. Example 11 details 
illustrative experimental procedures to be followed. 

A second novel method disclosed by the present invention for isolating functional plant origins is a 

so shotgun cloning approach (see Figure 8). Higher eukaryotic organisms have many replication origins 
distributed throughout their genomes. The A. thaliana genome contains approximately 1000 origins spaced 
every 70 kb along the chromosome (see Van't Hot et al., Chromosoma 68:269-285 (1978)). Therefore, it is 
quite reasonable to look for random fragments of genomic DNA from A. thaliana which promote extrach- 
romosomal replication. The details of performing such a shotgun cloning strategy are presented in Example 

55 12. 



IV. CENTROMERE 
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As previously discussed, a centromere is the primary constriction of the chromosome and the site of 
kinetochore assembly (see Blackburn et ai. t Ann. Rev, Biochem. 53:163-194 (1984); Clarke et ai. t Ann Rev. 
Genet 19:29-56 (1986)). The kinetochore is the physical structure that mediates the attachment of the 
spindle fibers to the chromosome and is therefore responsible for the proper partitioning of the chro- 
5 mosomes at mitosis and meiosis. The DNA sequences found at the centromere (CEN sequences) 
presumably play a role in specifying kinetochore assembly. 

CEN sequences, have been isolated from S. cerevisiae and Schizosaccharomyces pombe. The func- 
tional CEN sequences of S. cerevisiae are less than 200 bp long and consist of three conserved sequence 
elements (see Clarke, supra). The functional CEN sequences in S. pombe have not yet been defined 
70 precisely (see Nakaseko et ah, EMBO J. 5:1011-1021 (1986)). 

CEN sequences from a higher eukaryotic system have not been isolated. In contrast to S. cerevisiae 
yeast kinetochores, which cannot be seen under the microscope, kinetochores from higher eukaryotes are 
extremely large (up to 0.7 microns), indicating that higher eukaryotic CEN sequences constitute long 
stretches of DNA (see, e.g., Bloom et al., Cell 29:305-317 (1982); Ris et al., Chromosoma 82:153-170 
75 (1981); Sakai-Weda, A., Cytologia 48:253-258 (1983)). 

The cloning strategy for isolation of plant CEN sequences used in the present invention capitalizes on 
the availability of cloned teiomeric sequences and telotrisomic mutants of A. thaliana that possess a 
telomere in a new chromosomal location (see Figure 9). 

Telotrisomics are a special type of trisomic mutant that contain an extra chromosome arm in the form of 
20 a telocentric chromosome. Koornneef et al., Genetica 61:41-46 (1983) isolated Tr5A t the telotrisomic mutant 
used here, as a derivative of a primary trisomic line which contained an extra intact chromosome number 5. 
Telotrisomics arise from primary trisomic lines as a result of breakage of the extra chromosome when it 
exists as an unpaired univalent during meiosis; the breakage is though to occur at, or close to, the 
centromere (Darlington, J. Genet. 37: 341 (1939)). The telocentric is formed by a "healing" event which 
25 leads to addition of a new telomere at the site of breakage. 

A telotrisomic mutant has two extra telomeres in each nucleus: 1) the telomere on the distal tip of the 
extra chromosome arm which is identical to the telomere on the corresponding full length chromosome and 
2) a new telomere adjacent to the centromere which has no counterpart in the normal chromosome 
complement. 

30 The new telomere in the telotrisomic mutant Tr5A may be detected as a restriction fragment length 
polymorphism (RFLP) on Southern blots comparing telotrisomic genomic DNA and wild type disomic 
genomic DNA which have been probed with the cloned A. thaliana telomere (see Example 7). 

In a preferred embodiment, once a telotrisomic RFLP has been identified, a genomic library may be 
constructed to clone the restriction fragment representing the CEN-telomere fusion (see Example 8). 

35 Several tests can be used to determine if the clones isolated by this procedure represent A. thaliana 
CEN sequences. First, the putative CEN sequences are used as hybridization probes on Southern blots of 
all known telotrisomic mutants. If the probe represents authentic CEN sequences, it recognizes different 
RFLPs in each telotrisomic mutant, corresponding to the different CEN-telomere fusion events. 

Functional assays for CEN sequences may also be used. The development of acentric autonomously 

40 replicating plant vectors, previously discussed, effectuates assays of potential CEN sequences for partition- 
ing function. The acentric vectors are lost at a high frequency in the absence of selection due to the 
nondisjunction of replicating molecules. Addition of a functional centromere causes proper partitioning of the 
replicated molecules to daughter cells, leading to stable maintenance of the extrachromosomal vectors (see 
Hsiao et al., Proc. Natl. Acad. Sci. USA 78:3760-3764 (1981)). 

45 Another functional test for CEN activity which may be used in the present invention is the creation of 
dicentric chromosomes (see Mann et al., Proc. Natl. Acad. Sci. USA 80:228-232 (1983)). Dicentric 
chromosomes are broken during anaphase when the two centromere/kinetochores become attached via 
spindle fibers to opposite poles. The breakage often leads to loss or rearrangement of the effected 
chromosome. To perform a dicentric assay, putative CEN sequences can be integrated into plant 

so chromosomes using the Agrobacterium/T-DNA system. The introduced sequences which function as a 
centromere cause a dicentric chromosome to be created. This chromosome undergoes breakage, followed 
by rearrangement or chromosome loss; these events can be detected by karyotype analysis (i.e., 
microscopic examination of the chromosomes) and/or the loss of marker genes introduced along with the 
CEN sequences. 

55 A different experimental approach to assaying putative CEN sequences involves characterization of 
proteins which bind to these sequences. Such proteins can be purified by affinity chromotography using the 
putative CEN sequences as a binding substrate (see Kadonaga et al., Proc. Natl. Acad. Sci. USA 83:5889- 
5893 (1986)). Antibodies raised against the proteins are then used to localize the antigen on the 
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chromosome using in situ techniques. Authentic CEN binding proteins are thus localized at the centromere. 



V. CONSTRUCTION OF CHROMOSOMES 

5 

Having now disclosed novel recombinant molecules and methods for preparing them so as to obtain 
functional essential chromosomal elements, it will be possible for those of ordinary skill in the art to 
construct such artificial chromosomes. Useful construction methods are well-known (see, for example, 
Maniatis, T., et al.. Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, 1982)). In addition to the 
70 essential elements, a selectable plant marker (e.g., antibiotic resistance gene), and a cloning site for 
insertion of foreign DNA must be included. In order to propagate the vectors in E. coli, it is necessary to 
convert the linear molecule into a circle by addition of a stuffer fragment between the telomeres, as 
indicated in Figures 8 and 10. An E. coli plasmid replication origin and selectable marker must also be 
included. 

75 Artificial plant chromosomes which replicate in yeast may also be constructed to take advantage of the 
large insert capacity and stability of repetitive DNA inserts afforded by this system (see Burke et al., 
Science 236:806-812 (1987)). In this case, yeast ARS and CEN sequences are added to the vector. The 
artificial chromosome is maintained in yeast as a circular molecule using the stuffer fragment to separate 
the telomeres. 

20 

VI. PLANT CELL TRANSFORMATION 

Deproteinized DNA may be introduced into plant cells by the following several methods known in the 
25 art. For example, Shillito et al., Biotechnology 3:1099-1103 (1985) detail a protocol for the stable transforma- 
tion of plant protoplasts at frequencies of up to 2% using electroporation. 

Other methodologies are also useful in the present invention. Microinjection techniques are disclosed in 
Crossway et al., Mol. Gen. Genet. 202:179-185 (1986). Microprojectile delivery may also be used (see Klein 
et al., Nature 327:70-73 (1987)). • 



EXAMPLES 



35 Having now fully described the present invention, the same will be more clearly understood by 
reference to certain specific examples which are included herewith for purposes of illustration only, and not 
intended to be limiting of the invention, unless specified. 

The following experimental parameters were utilized in all manipulations. 

\ 

40 

Plasmids and Probes 

Poly d(CA/GT) was purchased from Boehringer Mannheim Biochemicals. pSDC12 and pSDC13 are 
described in Levinson et al. t J. Mol. Appl. Genet. 2:507-517 (1984). pKDR1 and pKDR2 contain the soybean 
45 rRNA genes on 3.75 and 3.9kb EcoRI fragments respectively (Echenrode et al., J. Mol. Evol. 21:259-269 

(1985) ). pARR16 and pARRl2 contain 2.5kb and 180bp A. thaliana EcoRI fragments consisting of rDNA and 
rDNA spacer DNA cloned in pUCl2 respectively; pARR20-1 contains a 180bp Hindlll fragment correspond- 
ing to the 180bp tandem repeat family described by Martinez-Zapater et al., Mol. Gen. Genet. 204:417-423 

(1986) , cloned in pUC12; and X1.4DNA is a Xenopus rDNA clone (Maden et al., Nucl. Acid Res. 6:817-830 
so (1979)). All DNA was prepared by standard procedures (Ausubel et al., Current Protocols in Molecular 

Biology (New York: John Wiley & Sons) (1987)). Radiolabeled probes were prepared either by nick 
translation or the random priming method using kits purchased from Boehringer Mannheim Biochemicals. 



55 Preparation of Genomic DNAs 

A. thaliana nuclear DNA (average size of approximately 100kb) was prepared as described by Hamilton 
et al., Ann. Biochem. 49:48 (1972). Genomic DNA from the A. thaliana Landsberg telotrisomic mutant TrSA 
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and its disomic siblings (Koornneef et al., Genetica 61:41 (1983)) was prepared by the miniprep procedure 
of Deilaporta et al., Plant Mol. Biol. Rptr. 1:19-21 (1983). Genomic DNA preparation from al! other plant 
species was performed as described in Ausubel et al., supra (1987). Human DNA was prepared by fysis of 
monolayers of HeLa cells with 100 mM Tris-HCI pH 8.0, 50 mM EDTA, 500 mM NaCI, 1% SDS, and 100 
5 mg/ml Proteinase K. After incubation at 50" C for 1 hour the iysates were phenol-extracted twice and 
chloroform-extracted once. The genomic DNA was then collected by ethanol precipitation. 



Source of Enzymes 

70 

All restriction enzymes were purchased from New England Biolabs, Boehringer Mannheim Biochemical, 
or International Biotechnologies, inc. Ba131 and T4 DNA Ligase were purchased from New England Bioiabs. 



75 Example 1 



Construction of Primary Library 

20 

High molecular weight genomic DNA was prepared and treated with Bal31 exonuclease to remove 
approximately 10 bps from every double-stranded end. The exonuclease digestion ensured that Bal3l- 
susceptible terminal structures at the telomere (such as a hairpin) would be removed and that a blunt end 

25 suitable for ligation would be created. 

The Bal31 -treated biunt ends of the genomic DNA were ligated independently to the plasmid cloning 
vectors pSDC12 and pSDCl3 that had earlier been digested with BamHI and Hindi (which cleave in the 
polylinker, leaving one blunt Hindi end and one sticky BamHI end) to generate linear vector DNAs with non- 
compatible ends. The ligation products were then digested with BamHI and Mbol to generate linear 

30 chimeric DNAs with compatible ends (Mbol, which recognizes the sequence 5 -GATC-3 , cleaves genomic 
DNA frequently (about every 300 bp) leaving BamHI compatible ends; Mbol + does not cleave within the 
vector sequences because pSDC12 and pSDC13 were purified from a dam host). The linear chimeric 
DNAs were circularized by ligation at low DNA concentration and transformed into an F recA E. coli host 
(JM109) to form one pSDC12 and one pSDC13 primary library. 

35 Specifically, 2 ug of genomic DNA were digested for 0.5 min (A. thaliana) or 2 min (S. cerevisiae) with 
0.25 U/mi Bal31 nuclease at a DNA concentration of 10 u,g/ml at 30 *C as described below. It was 
previously determined that under these conditions the rate of Bal3l exonucleolytic digestion was approxi- 
mately 15 bp/min/end. pSDC12 and pSDC13 were digested separately with Hindi followed by digestion with 
BamHI; the DNAs were precipitated with spermine and the 5 phosphates were removed with calf intestinal 

40 phosphatase. (The phosphatase step was omitted when constructing the yeast primary libraries, resulting in 
a high number of plasmids without inserts.) The linearized plasmid DNAs were ligated to the BaI31 -treated 
yeast or A. thaliana genomic DNAs with T4 DNA Ligase at 16* C using 20,000 U/ml of enzyme, a 10 fold 
molar excess of vector to genomic DNAs, and a DNA concentration of 130 ng/ml. The ligation products 
were digested with Mbol and BamHI and circularized with T4 DNA Ligase at 4 # C using 80 U/ml of enzyme 

45 at a DNA concentration of 1.3 ug/ml. Transformation of the circularized DNAs into competent JM109 cells 
(Stratagene) generated two primary libraries for both A. thaliana and yeast. 

Example 2 

so 



Assembly of Secondary Telomeric Libraries 



Colonies representing both primary libraries (yeast primary libraries contained 474 (pSDC12) and 541 
(pSDC13) colonies (see Table 1); A. thaliana primary libraries contained 5500 (pSDC12) and 5700 (pSDC13) 
colonies) were scraped from agar plates, grown en masse in liquid culture, and infected at a multiplicity of 
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infection of 10 with the M13 helper phage rvl (Levinson et al., J. Moi. App. Genet 2:507-517 (1984)). The 
infected cultures were diluted 100-200 fold, grown 10-14 hr at 37* C, and phage particles were purified from 
the supernatants as described in Levinson et aL, supra (1984). Phenol extraction and ethanoi precipitation of 
the purified phage yielded single-stranded DNA representing each primary library. 

5 For both A. thaliana and yeast the single-stranded DNAs from the two primary libraries were mixed in 
equimolar amounts and allowed to anneal in 25 mM K* phosphate buffer pH 6.8 at 42* C for 3 hr 
(approximately 10 X Cot^ for the A. thaliana primary libraries). Cross-hybridizing single-stranded DNA 
molecules were isolated as partially double-stranded DNAs using hydroxylapatite chromatography as 
follows: 0.2 g of hydroxylapatite (BioRad HTP) hydrated in 100 mM K* phosphate buffer, pH 6.8 t and 

iq heated to 100* C for 30 min (Martinson et aL, Anal. Biochem. 61:144-154 (1974)) was equilibrated in 25 mM 
K* phosphate buffer, pH 6.8, and poured into a water-jacketed column (0.7 cm inner diameter) that was 
maintained at 42* C. The DNA hybridization solutions were diluted into 0.5 ml of 25 mM K* phosphate 
buffer, pH 6.8, and applied to the column. Single-stranded DNA was eluted by washing the column with 8 
ml of 100 mM K* phosphate, pH 6.8, at 42 °C followed by 3 ml of the same buffer at 50 *C. Partially 

rs double-stranded DNAs were eluted from the column with 2-4 ml of 200 mM K* phosphate buffer, pH 6.8, at 
50 *C. The fractions containing the partially double-stranded plasmids were concentrated by sec-butanol 
extraction followed by a chloroform extraction: the phosphate buffer was exchanged with 10 mM Tris-HCI 
pH 8,0, 1 mM EDTA, 100 mM NaCI using Sephadex G-50 (Pharmacia) spin columns, and the DNA was 
ethanol-precipitated. Secondary libraries were formed by transforming HB101 with the purified cross- 

20 hybridizing single-stranded plasmids using the protocol developed by Michael Scott (personal communica- 
tion). 

The A. thaliana secondary library was screened to identify abundant non-telomeric repetitive DNA 
clones by colony hybridization (described below) using a mixed probe containing the inserts of pARR12, 
pARR20-1, pKDR1. and pKDR2. Colonies which did not hybridize to this mixed probe were saved in 

25 ordered arrays in 96 well microtiter dishes at -80* C in LB media containing 15% glycerol. 

For characterizations of the yeast primary and secondary libraries, colony hybridizations were per- 
formed using nitrocellulose filters according to the protocols of Grunstein et al., PNAS USA 72:3961-3965 
(1975) with the following exceptions: the hybridizations (using poly d(CA/GT) probe) were carried out in 1 M 
NaCI, 10% dextran sulfate, 1X P Buffer (New England Nuclear. Gene Screen protocols), and 25 ug/ml E. 

30 coli DNA at 60-65* C. The filters were washed in 2X SSC, 0.5% SDS at. 50-55 * C. The A. thaliana secondary 
library was gridded out on targe (22 X 22 cm) agar plates and transferred to uncharged nylon membranes 
(BioTrans, ICN). The colonies were lysed by autoclaving the membranes for 1 min, and the hybridizations 
(using pAtT4 insert probe) were carried out at 65* C using 1 M NaCI, 10% dextran sulfate, 1% SDS. 25 
ug/ml tRNA, and a probe concentration of 10 s cpm/ml. The filters were washed at 60 *C with 0.2X SSC, 

35 0.1% SDS. 



Example 3 

Screening of Secondary Telomeric Libraries 



45 To screen the A. thaliana secondary library, liquid cultures inoculated from the microtiter dishes were 
pooled in groups of 5-6 and the plasmid DNAs purified by the alkaline lysis miniprep procedure (Ausubel et 
al„ supra (1987)). The plasmid pools were radiolabeled by nick translation using 32 P-dCTP (New England 
Nuclear) and used to probe genomic Southern blots (Haelll digests) of A. thaliana nuclear DNA which had 
been previously digested with Bal31 nuclease for 0, 7, and 30 min under the following conditions: 

so High molecular weight A. thaliana nuclear DNA was digested with 0.25 U/ml Bal31 nuclease at 30* C at 
a DNA concentration of 10 ug/ml in 12 mM CaCI 2 , 24 mM MgCk, 0.2 M NaCI, 20 mM Tris-HCI pH 8.0, 1 
mM EDTA, and 100 ug/ml BSA. Modifications of these parameters used for Bal31 digestion of Z. mays and 
human genomic DNA are indicated in the appropriate figure legends. Reactions were stopped at the 
indicated times by addition of EGTA to a final concentration of 20 mM, extracted with phenol and 

55 chloroform, and precipitated with ethanoi. 

Southern blotting and hybridization were carried out with nylon filters (GeneScreen, NEN) using the UV 
cross-linking procedure of Church and Gilbert, PNAS USA 81:1991-1995 (1984)). Generally, hybridizations 
were done in 0.5 M NaHPO*. pH 7.2, 7% SDS, 1 mM EDTA, and 1% BSA (Church and Gilbert, supra 
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(1984)) at 50* using a probe concentration of 10 s cpm/ml. Filters were washed under the followjng 
conditions: Low stringency = 2X SSC, 1% SDS at 60° C; high stringency = 0.2X SSC, 0.1% SDS at 60° C. 

In this case, the blots were washed under in 2X SSC, 1% SDS at 55* C. Piasmid pools that hybridized 
to Bal31 nuclease-sensitive bands or bands that ran at the limit of mobility were split into individual clones 
5 to identify potential teiomeric clones. 



Example 4 

10 

Identification of pAtT4 Teiomeric Insert 



15 To identify A. thaliana teiomeric clones, the screening procedure described in Example 3 was used. 

One clone, designated pAtT4, was found which hybridized to exonuclease-sensitive restriction fragments. 

This piasmid was isolated as a stable derivative of a larger piasmid that suffered spontaneous deletions. 
Figure 2 shows a Southern blot of Ba!31 -treated A. thaiiana nuclear DNA which was probed with 

radiolabeled pAtT4. The lanes at the left in Fig. 2A contain Bal31 -digested nuclear DNA which was 
20 subsequently cut with Mbol. In the non-Bal31 -treated lane, pAtT4 hybridized to a smear representing 

restriction fragments ranging from 2.0 to 5.0kb in length. The size and intensity of the heterodisperse Mbol 

band decreased upon increasing exonuciease treatment. The rate of exonuclease digestion was linear and 

approximately 2.5kb of sequence had to be removed to eliminate most of the pAtT4 hybridization. 

The righthand lanes in Fig. 2A show the hybridization pattern of pAtT4 to BamHI-digested Bal31-treated 
25 genomic DNA. In this case, the hybridization signal was resolved into several higher molecular weight 

bands; these bands were still diffused, indicating that the probe hybridized to a collection of restriction 

fragments that were heterogeneous in size. The BamHI bands become smaller and eventually disappeared 

with increasing Bal31 digestion. 

To demonstrate that the exonuclease sensitivity of pAtT4 homologous sequences was not a property of 
30 genomic sequences in general, the fitter shown in Fig. 2A was reprobed with A. thaliana ribosomal DNA. 

The results (Fig. 2B) show that the rRNA gene probe hybridized to several bands, none of which shifted 

mobility due to exonuciease treatment. 

35 Example 5 



Sequencing of pAtT4 Teiomeric insert 

40 

The pAtT4 insert was sequenced by the dideoxy method using double-stranded template following the 
protocols described in Ausubel et aL, supra (1987). The simple sequence insert template caused pausing of 
the reverse transcriptase, making it difficult to read sequence far from the primer; for this reason, both 
45 strands of the insert were not sequenced in their entirety. The sequence of the teiomeric repeats was 
confirmed by sequencing pAtT4 using the chemical method of Maxam and Gilbert, Meth. Enzymol. 65:499- 
560 (1980). 

The DNA sequence of pAtT4 insert is shown in Fig. 4. The insert is composed almost entirely of 
tandemly repeated blocks of the se quence 5-[CCCTAAA]-3'. Two variant repeats, 5'-[CTCTAAA]-3\ were 

so found in tandem in the middle of the insert. The simple sequence repeated DNA abuts the blunt end 
cloning site of the vector (noted as 1/2 Hindi in Fig. 4); this end of the insert corresponds to the terminus of 
the chromosome. It should be kept in mind, however, that since the genomic DNA was initially treated with 
Bal31 to remove a small number of nucleotides from the ends, before the cloning was performed, any DNA 
structures (such as foldbacks, nicked strands) at the extreme terminus were not preserved in the pAtT4 

55 insert. 

From the teiomeric end of the insert, the tandemly repeated sequence extends 386 bp, comprising 55 
complete repeat units. The orientation of the repeats is such that the C-rich strand points towards the 
centromere when read in the 5 to 3' direction. In addition, the insert contains 17 nucleotides of a nonrepeat 
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flanking sequence which includes the Mbol site used to clone the terminal restriction fragment. 

The discrepancy between the size of the pAtT4 insert (403 bp) and the telomeric Mbol restriction 
fragments detected on Southern biots (2.5-5.0 kb) (see Fig. 2A) is most likely due to internal deletion of the 
telomeric repeats during propagation of the plasmid. Consistent with this interpretation is the fact that pAtT4 
5 was isolated as a stable derivative of a larger plasmid which suffered spontaneous deletions. Similar 
instability of simple sequence telomeric repeat clones from Trypanosomes has been noted (see Blackburn 
et aL, Cell 36:447-457 (1984)). 



10 Example 6 



Representation of pAtT4 Homologous Clones in the A. thaliana Primary and Secondary Libraries 

The representation of telomeric clones in the A. thaliana primary and secondary libraries was deter- 
mined by colony hybridization using the purified insert of pAtT4 as a probe. Approximately one out of 5000 
clones in each amplified primary library hybridized to the pAtT4 insert probe (frequency of 2 X 10~ 4 ). Since 
20 each primary library was composed of approximately 5500 members, this result suggests that each of the 
original primary libraries contained a single telomeric clones. The second library, on the other hand, 
contained five out of a total of 765 clones which hybridized to the pAtT4 insert probe (frequency of 6,5 X 
10~ 3 ). The assembly of the secondary library resulted in a roughly 30-fold enrichment for telomeric clones 
relative to the primary libraries. 

25 

Example 7 



30 

isolation of Telomeric DNA Sequences from Yeast Using the Enrichment Strategy 



To determine the effectiveness of the telomeric cloning and enrichment strategy outlined in Figure 1 , an 
35 experiment was undertaken to clone yeast telomeric sequences by this method. The results are presented 
in Table One. 

TABLE ONE 



Type of 


Vector 


Members 


% Inserts 


Members 


#CA/GT 


Freq 


Library 








w/lnserts 






Primary 


pSDC12 


474 


50% 


240 


4 


0.017 


Primary 


PSDC13 


541 


65% 


350 


6 


0.017 


Secondary "A" 




164 


100% 


164 


28 


0.17 


Secondary "B" 




134 


nd 


<134 


27 


0.20 



The type (primary versus secondary) and size (number of member clones) of the yeast libraries are 
indicated. Secondary W A" and "B" denote two independently assembled secondary libraries. The % insert 
was estimated from the study of 20 randomly selected clones (nd = not determined). # CA/GT refers to the 
number of clones which hybridized to a radiolabeled poly d(CA/GT) probe, as determined by the colony 
hybridization method. The frequency of poly d(CA/GT) hybridizing clones in each library (Freq) was 
calculated by dividing # CA/GT by the number of members with inserts. 

The pSDC12 and pSDC13 yeast primary libraries contained 240 and 350 clones with inserts, respec- 
tively. Approximately one out of sixty clones in both yeast primary libraries hybridized to a poly d(CA/GT) 
probe that cross-hybridized to the yeast telomeric sequence (Ci. 3 A) n , as well as short stretches of (CA) n 
found in the chromosome arms (Walmsiey et al., PNAS USA 82:506-510 (1983)). 

The following observations indicate that most of the poly d(CA/GT) hybridizing clones contained 
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telomeric sequence inserts: (1) 9 out of 10 clones had the insert orientation predicted for cloning of 
telomeric restriction fragments (with asymmetric C-rich and G-rich strands) into the pSDC12 and pSDC13 
complementary vectors, (2) sequencing of one pSDC13 clone showed that the (C^A),, sequence abuts the 
blunt end cloning site of the vector, (3) Southern blot experiments indicated that at least three clones 
s contained inserts which cross-hybridized with telomere flanking sequences (data not shown). 

Two separate secondary libraries (designated A and B) were assembled from the primary yeast libraries 
as described in Figure 1. As shown in Table One, the frequency of clones hybridizing with the poly d- 
(CA/GT) probe was approximately 0.2 in both secondary libraries, indicating that a ten-fold enrichment in 
clones that hybridized to the poly d(CA/GT) probe was achieved. 



Example 8 



T5 

Identification of a Telotrisomic RFLP by the Telomeric Clone pAtT4 



To determine if pAtT4 would identify a telotrisomic RFLP, Southern blots of genomic DNA were 

20 prepared from telotrisomic mutant Tr5A and wild type disomic plants. Genomic DNA from the A. thaliana 
Landsberg telotrisomic mutant and its disomic silbings was prepared by the miniprep procedure of 
Dellaporta et al. v Plant Mol. Biol. Rptr. 1:19-21 (1983). One feature of telotrisomic plants that is of particular 
use in this experiment is that both disomic and telotrisomic plants appear in the progeny after selfing 
telotrisomic parents. Therefore, the telotrisomic progeny should be isogenic with their disomic siblings 

25 except for the presence of the extra telocentric chromosome. Moreover, since the telocentric chromosome 
is free to recombine with the full length chromosomes, and the telotrisomic stocks are maintained by selfing 
mutant plants, the only qualitative difference between the genomes of the telotrisomic and the disomic 
progeny should be the new telomere. 

Telotrisomic and normal disomic plants are distinguished on the basis of gross morphology. Disomic 

30 plants are generally larger and have broad leaves while Tr5A telotrisomics are semi-dwarf plant with narrow 
and slightly serrated leaves. There is some overlap in the phenotypic norms; in particular, individual diploid 
plants that are growing slowly will occasionally be identified incorrectly as telotrisomic. It is unlikely, 
however, that telotrisomic progeny will be mistaken for disomies. 

As shown in Fig. 38, pAtT4 hybridized to a 15 kb Dral restriction fragment in the Tr5A genome that is 

as absent from the disomic genome. This novel restriction fragment corresponds to the functional telomere that 
was added next to the centromere. 

The experiment shown in Fig. 3B was repeated using DNA isolated from 14 individual progeny plants 
derived from selfing a single Tr5A parent. Of the 14 plants, 9 were identified as disomies; none of these 
plants contained the polymorphic 15kb Dral marker identified by pAtT4. This result indicates that the RFLP 

40 was linked to the new telomere since this telomere is the only genetic marker that can not be crossed away 
from the telocentric chromosome onto the full length chromosomes. The remaining 5 plants were identified 
phenotypically as telotrisomics; of these, 3 displayed the telotrisomic pattern defined in Fig. 3B, and 2 
displayed the disomic pattern. As expected, the RFLP was only found in plants phenotyped as telotrisom- 
ics. The "telotrisomic" plants which did not contain the marker were most likely incorrectly phenotyped 

45 disomic plants and not a consequence of recombination between the new telomere and the polymorphic 
marker since such a recombination even was not detected from study of the disomic siblings. 

The correlation between the presence of the new telomere and the pAtT4 15kb Dral polymorphic 
marker, in conjunction with the exonuclease sensitivity of the pAtT4 homologous genomic sequences, 
argues convincingly that pAtT4 hybridized to telomeric sequences in A. thaliana. 

so Another feature of the blot shown in Fig. 3B is of interest. The heterodisperse bands in the Tr5A lane 
are slightly smaller than the corresponding bands in the disomic lane, indicating that the telomeres in the 
mutant are smaller. This size difference is not surprising since telomeres in lower eukaryotic systems are 
known to be dynamic structures which change length depending on the genetic background and growth 
conditions of the organism. 

55 

Example 9 
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Isolation of Plant Centromeres Using Cloned Plant Telomeres 



Once a telotrisomic RFLP is identified, a genomic library may be constructed to clone the restriction 
5 fragment representing the CEN-telomere fusion. Genomic DNA from the telotrisomic mutant is treated 
briefly with the exonuclease Bal31 to generate blunt ends at the telomere, as was done in the telomere 
cloning strategy (see Figure 1). The end-repaired genomic DNA is then digested with the restriction 
endonuclease which defines the RFLP, and the population of DNA molecules corresponding to the size of 
RFLP isolated. The size-selected DNA is cloned into a circular episomal yeast vector that carries a 
io functional CEN and ARS. The yeast vector-host system Is preferably used because of its tolerance for long, 
repetitive DNA inserts (see Burke et al.. Science 236:806-812 (1987)). The desired CEN-telomere clone may 
be identified in the library by hybridization with the isolated A. thaliana telomere. Alternatively, an E. coli 
vector-host system can be used. 

75 

Example 10 



20 Homology Between A. thaliana Telomere Clone and Telomeric Sequences in More Complex Plants 



To determine whether the telomeric repeats from A. thaliana were homologous to telomeric DNA 
sequences in other higher plants, Southern blots of genomic DNA were prepared from a variety of plant 

25 species. Genomic DNA preparation from all plant species other than A. thaliana was performed as 
described in Ausubei et al. f In: Curr. Prot Molec. Bio!., pp. 2.3 (New York: John Wiley & Sons) (1987) and 
incorporated herein by reference. These DNA sequences were probed with radio-labeled pAtT4 and washed 
at high stringency (see Ausubei, supra). The probe hybridized to genomic sequences from all plants tested 
including Brassica campestris, Brassica carinata, Raphanus sativus (Radish), Lycopersicon esculentum 

30 (Tomato), Nicotiana tabacum (Tobacco), Medicago sativa (Alfalfa), and Zea mays (Corn). All of these plant 
species are dicots with the exception of the monocot Z. mays. 

A. thaliana is a dicot and Z. mays represents an evolutionarily diverged plant. Thus, Z. mays 
represented an excellent test vehicle for determining the applicability of the teachings of the present 
invention to higher plant species. 

35 To determine if the pAtT4 cross-hybridizing sequences in the Z. mays genome were located at the 
telomeres, high molecular weight Z. mays genomic DNA was treated with Bal31 for various lengths of time, 
digested with the restriction endonuclease Bell, size fractionated, and transferred to a nylon membrane. The 
filter was then probed with pAtT4 and the results are shown in Fig. 5A. 

The pAtT4 hybridization pattern includes several bands larger than 14kb and a collection of incom- 

40 pletely resolved heterodisperse bands of lower molecular weight. The bands become less intense and 
display increased electrophoretic mobility upon increasing Bal31 treatment. A faint 8kb band that did not 
change in size during exonuclease digestion can also be seen in Fig. 5A, demonstrating that sensitivity to 
Bal3l digestion is not a property of all genomic sequences. 

This experiment indicates that A. thaliana telomeric repeats cross-hybridize to telomeric sequences in 

45 Z. mays. 



Example 1 1 

50 

Cross-Hybridation of A. thaliana Telomere Clone to Telomeric Sequences in the Human Genome 



55 Southern blots of the genomic DNA prepared from some well-studied animal model systems were 
probed with pAtT4 to determine if the A. thaliana telomere repeat would cross-hybridize to animal 
telomeres. 

Human DNA was prepared by lysis of monolayers of HeLa cells with 100 mM Tris-HCl pH 8.0, 50 mM 
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EDTA, 500 mM NaCI, 1% SDS, and 100 ug/ml Proteinase K. After incubation for 1 hour, the lysates were 
phenol-extracted twice and chloroform-extracted once. The genomic DNA was then collected by ethanol 
precipitation. The A. thaliana telomere probe hybridized to both human and Caenorhabditis elegans DNA 
under low stringency conditions, but pAtT4 did not hybridize to the Drosophila melanogaster genome. 

To test if the pAtT4-cross-hybridizing sequences in the human genome were telomeric, pAtT4 was used 
to probe a Southern blot (Hindlll digest) of Bal31 -treated human DNA. Figure 6A shows that pAtT4 
hybridized to a heterodisperse band ranging from 8.5 to 15kb, in addition to two well-defined higher 
molecular weight bands of 22 and 45kb. The 22 and 45kb band shifted mobility and eventually disappeared 
with increasing exonuclease digestion. Two faint bands of 2.3 and 3.5kb are also seen in Figure 6A, which 
were not affected by the exonuclease treatment. 

As a control, the filter shown in Fig. 6A was stripped and reprobed with a Xenopus rDNA clone which 
cross-hybridized to human rRNA genes. Restriction fragments carrying the human rRNA genes do not 
change mobility during the exonuclease treatemtn, as shown in Fig. 6B. 

These data suggest that most of the pAtT4 cross-hybridizing sequences of human DNA are telomeric. 



Example 12 



First Method of Isolation of Autonomous Replicating Sequence 



An Inverted telomere repeat is integrated into an A. thaliana chromosome using the Agrobacterium/T- 
DNA transformation system. Genes for different drug resistances are placed on either side of the telomere 
palindrome. After integration of this construct into an A. thaliana chromosome, a resolution reaction 
generates two chromosome fragments, both of which can be selected for by requiring that transformants to 
be resistant to both drugs. Since the average A. thaliana chromosome is only 20Mb long, random 
integration of the telomere palindrome leads to the formation of an isolatabie (<1Mb) acentric minich- 
romosome in approximately 10% of the transformation events. The isolated minichromosome is further 
fragmented using in vitro cloning techniques and assayed for extrachromosomal replication in plant cells. 

In this way, the sequences comprising a functional plant replication origin may be defined. 



Example 13 



Second Method of Isolation of Autonomous Replicating Sequences 



At a first step, a plasmid cloning vector is constructed with the following features: i) a plasmid 
replication origin and a selectable marker that functions in E. coli, ii) a selectable marker that functions in 
plant cells, iii) a cloning site(s) for insertion of plant genomic DNA fragments, and iv) an A. thaliana telomere 
inverted repeat separated by a stuffer fragment flanked by Notl sites (the stuffer fragment permitted 
replication of the plasmid as a circle in E. coli). 

An A. thaliana genomic library is then constructed for this cloning vector and maintained in an E. coli 
host. Plasmid DNA prepared from this library is digested with the restriction endonuclease Notl to generate 
linear molecules with two telomeric ends. These linear DNA are then introduced into plant cells by direct 
DNA transformation (see Potrykus et al., Plant Mol. Biol. Rptr. 3:117-128 (1985)) and the transformations 
selected. 

Introduction of the DNA into plant cells as linear molecules with telomeric ends reduces the frequency 
of random integration into the chromosomes caused by recombinogenic, broken DNA ends formed during 
transfection (see. for example, Perrot et al., Mol. Cell Biol. 7:1725- 1730 (1987). Since extrachromosomally 
replicating DNA molecules without centromeres are mitoticaily unstable, screening for transformants which 
lose the marker under nonselective growth is required (see Hsiao et al., Proc. NatL Acad. Sci. USA 78:3760- 
3764 (1981)). Once unstable transformants are identified, extrachromosomal DNA is prepared to recover the 
linear minichromosomes. The ARS sequence responsible for extrachromosomal replication is then identified 
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and studied by mutational analysis. 



Example 14 

5 



Construction of Artificial Plant Chromosomes 

70 

Plant artificial chromosomes are constructed by combining the previously isolated essential 
chromosomal elements. Various configurations of assembled artificial chromosomes are described below for 
purposes of illustration (see Figure 10). These artificial chromosomes are designed to be "shuttle vectors"; 
they can be maintained in convenient host (either E. coli or yeast) as well as plant cells. 

15 An artificial chromosome can be maintained in E. coli as a circular molecule by placing a removable 
stuffer fragment between the telomeric sequence blocks (see Figure 10(A)). The stuffer fragment is a 
dispensible DNA sequence, bordered by unique restriction sites, which can be removed by restriction 
digestion of the circular DNAs to create linear molecules with telomeric ends. In addition to the stuffer 
fragment and the plant telomeres, the artificial chromosome contains a replication origin and selectable 

20 marker that can function in E. coli to allow the circular molecules to be maintained in E. coli cells. The 
artificial chromosomes also include a plant selectable marker, a plant centromere, and a plant ARS to allow 
replication and maintenance of the DNA molecules in plant cells. Finally, the artificial chromosome includes 
several unique restriction sites where additional DNA sequence inserts can be cloned. The most expeditious 
method of physically constructing such an artificial chromosome, i.e., ligating the various essential elements 

25 together for example, will be apparent to those of ordinary skill in this art. 

Figure 10(B) illustrates an artificial chromosome that can be maintained as a circular molecule in yeast 
cells. Again, a stuffer fragment is included to separate the telomeric sequence blocks and facilitate the 
replication of the molecule as a circle. The artificial chromosome also contains a plant selectable marker, a 
plant centromere, a plant ARS, and cloning sites. To allow maintenance of the DNA molecule in yeast, a 

30 yeast ARS, a yeast contromere, and yeast selectable marker are also included. 

Alternatively, a plant artificial chromosome is constructed that can be maintained as a linear molecule in 
yeast by utilizing the A. thaliana telomeres as shown in Figure 10(C). 

The instant disclosure sets forth all essential information in connection with the invention. It has been 
described in some detail by way of illustration and example for purposes of clarity and understanding. It will 

35 be obvious that certain changes and modifications may be practiced within the scope of the invention, as 
limited only by the scope of the appended claims. 



Claims 

40 

1. A recombinant DNA molecule characterized in that it contains the telomere of a higher eukaryote. 
preferably a plant, especially Arabidopsis thaliana . 

2. The recombinant DNA molecule of claim 1, characterized in that it additionally contains a centromere 
and/or an autonomous replicating sequence and/or a selectable marker gene. 

45 3. The recombinant DNA moiecule of claim 1, characterized in that it is capable of being maintained as 
a chromosome in a prokaryote, preferably a bacterium, or in a eukaryote, preferably a higher eukaryote, 
especially a plant or a plant tissue culture cell. 

4. The recombinant DNA molecule of claim 3, characterized in that it additionally contains a desired 
gene sequence, preferably a gene sequence selected from the group consisting of: 

so (1 ) a gene sequence of a hormone gene; 

(2) a gene sequence of an antibiotic resistance gene; 

(3) a gene sequence of a nitrogen fixation gene; 

(4) a gene sequence of a plant pathogen defense gene; 

(5) a gene sequence of a plant stress-induced gene; 
55 (6) a gene sequence of a toxin gene; and 

(7) a gene sequence of a seed storage gene. 
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5. The recombinant DNA molecule of claim 4, characterized in that it is capable of expressing said 
desired gene sequence in a prokaryote or in a eukaryote, preferably in a higher eukaryote, especially in a 
plant or a plant tissue culture cell. 

6. A method of producing a polypeptide in a recipient prokaryote cell, preferably a bacterium, or in a 
s eukaryote cell, preferably a higher eukaryote, especially a plant or a plant tissue culture cell, characterized 

by 

(a) introducing a gene sequence capable of producing said polypeptide into a recombinant molecule, 
said recombinant molecule containing a telomere of a higher eukaryote; 

(b) introducing said recombinant molecule containing said cloned gene sequence into a recipient cell; 

w and 

(c) incubating said cell under conditions sufficient to permit the production of said polypeptide. 

7. The recombinant DNA molecule of claim 1, characterized in that said telomere comprises tandemly 
repeated blocks of the base sequence s'-[CCCTAAA]-3' or variant repeats described by the base sequence 

75 5'-[CTCTAAA]-3'. 

8. The recombinant DNA molecule of claim 1 , wherein said telomere is the pAtT4 plasmid, Accession 
Number ATCC 67577. 

9. A host cell, preferably a eukaryotic cell, especially a higher eukaryotic ceil, particularly a plant cell, 
transformed with the recombinant DNA molecule of claim 1. 

20 10. A DNA molecule bearing the formula (CCCTAAA)n, where n — 1 and up to 1000. 



Claims for the following Contracting state: ES 

25 1. A method of producing a polypeptide in a recipient cell, characterized by 

(a) introducing a gene sequence capable of producing said polypeptide into a recombinant molecule, 
said recombinant molecule containing a telomere of a higher eukaryote; 

(b) introducing said recombinant molecule containing said cloned gene sequence into a recipient cell; 

and 

30 (c) incubating said cell under conditions sufficient to permit the production of said polypeptide. 

2. The method of claim 1 , characterized in that said recipient ceil is a prokaryote. 

3. The method of claim 2, characterized in that said prokaryote is a bacterium. 

4. The method of claim 1 , characterized in that said recipient cell is a higher eukaryote. 
35 5. The method of claim 4 t characterized in that said higher eukaryote is a plant. 

6. The method of claim 4. characterized in that said higher eukaryote is a plant tissue culture cell. 

7. The method of claim 1, characterized in that said telomere comprises tandemly repeated blocks of 
the base sequence 5'-[CCCTAAA]-3'. 

8. The method of claim 7, characterized in that said telomere comprises variant repeats described by 
40 the base sequence 5-[CTCTAAA]-3'. 

9. The method of claim 1, characterized in that said telomere is the pAtT4 plasmid, Accession Number 
ATCC 67577. 

10. The method of claim 1, characterized in that said recombinant molecule is capable of being 
maintained as a chromosome and additionally contains a desired gene sequence which is selected from the 

45 group consisting of: 

(1 ) a gene sequence of a hormone gene; 

(2) a gene sequence of an antibiotic resistance gene; 

(3) a gene sequence of a nitrogen fixation gene; 

(4) a gene sequence of a plant pathogen defense gene; 
so (5) a gene sequence of a plant stress-induced gene; 

(6) a gene sequence of a toxin gene; and 

(7) a gene sequence of a seed storage gene. 
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i genes ana replication origins" " 



Step Two: Construct a telomere inverted repeat, flanked by two 
different drug resistance markers in T-DNA vector 

telomere 
p inverted p 
drugA* repeat druqB* 

i^- 



Step Three: Intoduce the telomere inverted repeat construct into a 
plant chromosome 



-=iX5= 



chromosome with unresolved telomere 
palindrome 



Step Four: Screen for resolution of the telomere inverted repeat by 
Southern blotting experiments 



and 



small acentric chromosome 



Step Five: Isolate small acentric chromosome; subclone to delimit ARS 

FIG. 7 



FIG.8 



Functional selection for Plant ARSs: Shotgun cloning 



Step One Start with a cloning vector of this general design 

Notl Not) 



telomere 



stuffer 



telomere 



plant 
marker 



sites 



LcoJi 
origin 
ana, 
marker 



Step Two: 



Construct a plant genomic library in this vector: 
maintain in E. coli 



Step Three: Prepare plasmid DNA from this library; I inearize with Notl 




genomic plant 
ONA insert 
ARS??? 



Step Four: Direct ONA transformation into plant protoplasts; select 
transformants as callus 



Step Five: Screen for instability of plant marker under nonselective 
growth, evidence for extrachromosomal maintenance 



Step Six; Prepare extrachromosomal ONA and recover plant ARS 
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Cloning Plant Centromeres using Telomeric probes and Telotrisomic 
mutants 

Step One: Prepare DNA from telotrisomic mutant and normal disomic plants 



clff-TELO 



=0= 
CEN 



TELO 



telotrisomic 



wild type disomic 



Step Two: Do a genomic Southern blot, probe with radiolabeled telomere; 
Look for RFLPs 

Telotrisomic Disomic 



RFLP 
corresponding to 
the telomere 
centromere 

fusion 




Southern Blot probed 
with radiolabeled 
telomere 



Step Three: Purify end-repaired, restricted genomic telotrisomic DNA 
corresponding to CEN-TELO RFLP; construct a library in a 
yeast ARS CEN vector 



Step Four; Screen the library using the telomere probe to find clones 
containing the CEN-TELO fusion fragment 



Step Five; Test putative CEN clones: Southerns of all felofrisomics, 
Mitotic stabilization Dicentric assay, DNA binding 
protein analysis 



PLANT ARTIFICIAL CHROMOSOMES 
A. E. coli/ PLANT CIRCULAR SHUTTLE VECTOR 



PLANT STUFFER 
TELOMERE ^FRAGMENT ^ _ 



PLANT 
TELOMERE 



_L 
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B. YEAST /PLANT CIRCULAR SHUTTLE VECTOR 



TELOMERE k fSSSSlT , TELOMERE 



1 



1 



7777m J 



MARKER 



C. YEAST/ PLANT LINEAR SHUTTLE VECTOR 



PLANT PLANT SELECTABLE PLANT 

TELOMERE ARS MARKER CENTROMERE 
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